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POWER AMPLIFIER PRE-DISTORTER TRAINING 

TECHNICAL FIELD 

The present invention relates to digital pre-distortion in power amplifiers 
with memory effects, and especially to pre-distorter training. 

BACKGROUND 

Power amplifiers are known to add more or less distortion to the signal they 
are designed to amplify. The reason for this is that a power amplifier has a 
non-linear input-output signal characteristic. This shows up as a broadened 
spectrum around the desired amplified signal, and as an unwanted inband 
component of the signal. As a counter-measure to decrease the effects of 
non-linearity, it is known to pre-distort the signal at the input of the ampli- 
fier as to give an un-distorted amplified signal at the output of the amplifier. 
This technique is called pre-distortion. Pre-distortion as implemented today 
normally uses a look-up table that is used to multiply the signal. The entry 
into the table is the magnitude of the signal at every time sample. 

Memory effects is another problem related to power amplifiers. Memory 
effects typically show up as a non-symmetrical spectrum around the carrier 
at the output of a power amplifier. That is, although the carrier (desired 
signal) spectrum is perfectly symmetrical, the spurious spectrum coming 
from the distortion may be non- symmetrical with respect to the center of the 
carrier. 

The methods commonly used to handle non-linearity do not in general take 
into account memory effects of the power amplifier. As the term "memory 
effects" indicates, there is a dependence not only on the present sample but 
also on delayed samples of the signal. Thus, a single table approach cannot 
take care of memory effects, but can only handle non-linearity. 
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Lei Ding et. al. [1] inspired by work by Kim and Konstantinou [2] have 
derived a pre-distortion method based on what they call "Memory Polynomi- 
als" that very well model memory effects. However, a drawback of this 
5 method is that it requires recalculation of the memory polynomials for each 

new input signal amplitude, which cam be computationally costly, especially 
if many polynomials of high order are used. A further drawback of this 
method is the computationally costly training procedure used to determine 
the polynomial coefficients. 

10 

Reference [3] describes an amplifier provided with a pre-filter in the form of a 
FIR filter instead of a memory polynomial based pre-distorter. The coeffi- 
cients of this filter are updated with an iterative least mean square algo- 
rithm. 

15 

SUMMARY 

An object of the present invention is to provide a computationally efficient 
training method for pre-distortion based on memory polynomials. 

20 

This object is achieved in accordance with the attached claims. 

The present invention is based on a FIR filter structure including individual 
look-up tables for the filter taps, where each look-up table represents a 
25 discretized memory polynomial. Briefly, the training method in accordance 

with the invention is based on iteratively and individually (independently) 
determining the filter coefficients of each look-up table, preferably by a 
corresponding iterative LMS (Least Mean Square) based procedure. 



30 



WO 2005/053153 



3 



PCT/SE2003/001825 



BRIEF DESCRIPTION OF THE DRAWINGS 

The invention, together with further objects and advantages thereof, may best 
be understood by making reference to the following description taken together 
with the accompanying drawings, in which: 

Fig. 1 is a diagram illustrating the non-linear input-output signal 
characteristic of a power amplifier; 

Fig. 2 is a diagram illustrating the spectrum of the signal amplified by a 
non-linear power amplifier; 

Fig. 3 is a diagram illustrating the input-output signal characteristic of 
a power amplifier pre-distorter for removing the non-linearity in Fig. 1; 

Fig. 4 is a diagram illustrating the input-output signal characteristic of 
a power amplifier provided with pre-distortion; 

Fig. 5 is a diagram illustrating the spectrum of the signal amplified by a 
non-linear power amplifier with memory; 

Fig. 6 is a diagram illustrating discretization of polynomials in accor- 
dance with the present invention; 

Fig. 7 is a block diagram of an exemplary embodiment of a pre-distorter 
suitable to be trained in accordance with the present invention; 

Fig. 8 is a block diagram of an exemplary embodiment of a base station 
including a power amplifier provided with a pre-distorter in accordance with 
Fig. 7; 

Fig. 9 is a flow chart illustrating an exemplary embodiment of the 
training method in accordance with the present invention; 

Fig. 10 is a block diagram of another exemplary embodiment of a base 
station including a power amplifier provided with a pre-distorter in accordance 
with Fig. 7; and 

Fig. 1 1 is a flow chart illustrating another exemplary embodiment of the 
training method in accordance with the present invention. 
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DETAILED DESCRIPTION 

In the following description the same reference designations will be used for 
the same or similar elements throughout the figures of the drawings. 

Before the invention is described in detail, a brief description of the underlying 
problem will be given below. 

Fig. 1 illustrates the non-linear input-output signal characteristic of a power 
amplifier. At low input signal amplitudes the amplifier is almost linear, but 
at higher amplitudes it becomes more and more non-linear until it is satu- 
rated. This non-linearity shows up as a broadened spectrum around the 
desired amplified signal (and as an unwanted inband component of the 
signal), as illustrated in Fig. 2. As a counter-measure to decrease the effects 
of non-linearity, it is known to pre-distort the signal at the input of the 
amplifier to give an un-distorted amplified signal at the output of the ampli- 
fier. This technique is called pre-distortion and is illustrated in Fig. 3. The 
input-output signal characteristic for a pre-distorted power amplifier is 
essentially linear up to saturation, as illustrated in Fig. 4. 

Memory effects is another problem related to power amplifiers. Memory 
effects typically show up as a non-symmetrical spectrum around the carrier 
at the output of a power amplifier, as illustrated in Fig. 5. That is, although 
the carrier (desired signal) spectrum is perfectly symmetrical, the spurious 
spectrum coming from the distortion may be non-symmetrical with respect 
to the center of the carrier. 

There is a theoretical way of designing a pre-distorter that takes care of all 
memory effects. This is called the Volterra series. The Volterra series is an 
extension to the well-known Taylor series, which can be used as a pre- 
distorter for memoiy-less amplifiers. The Volterra series, however, also takes 
into account time-delayed terms that may quite accurately model the pre- 
distortion, and may therefore be used to suppress the distortion spectrum. 
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However, a Volterra series quite rapidly gets large in terms of the number of 
possible terms in the expansion. For example, a polynomial of degree 5 with 
a memory depth (maximum delay) of 5 sample units will give rise to at least 
500 coefficients. 

Since the full Volterra series can not be implemented with reasonable 
complexity, an approximation based on "Memory Polynomials" has been 
suggested in [1]. In this approximation the pre-distortion PD(n) may be 
expressed as: 

PD(n) = £ £ V (« - q)\x(n - qf~ l (1) 

Unfortunately this expression is still quite complicated, and a drawback of 
this prior art method is that the expression has to be evaluated for each new 
input sample x(n). However, as will be shown below, this expression may be 
rewritten into a more suitable form for practical implementation. The 
derivation essentially includes three steps: 

1. Separate the double sum into partial sums including only terms with 
the same delay. This gives: 
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2. Here it is noted that the delayed signals x(n-q) do not depend on the 
summation indices fc. Thus the partial sums may be factorized into: 
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3. Identify the polynomials T q (\x(n-q)\) to obtain: 
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It is noted in (2) that T g (\x(n-q) \) are polynomials in the absolute value of the 
(complex) variable x(n-q). Thus, by multiplying each delayed complex sample 
x(n-q) by a polynomial in \x(n-q) | (which has the same delay q) and summing 
up the products for all delays q, the same end result P(n) as in [1] will be 
obtained. However, this new approach has the advantage that the polynomi- 
als T q may be sampled at appropriate values of \x(n-q)\, as illustrated in Fig. 
6, and stored in look-up tables. This will reduce the pre-distorter to a simple 
FIR filter structure, in which the normally constant filter coefficients are 
replaced by these look-up tables, as illustrated in Fig. 7. The problem 
addressed by the present invention is to determine the values of the polyno- 
mials T q at the sampling points in Fig. 6. 

In the exemplary embodiment of the present invention illustrated in Fig. 7, 
the complex input signal x(n) is forwarded to an absolute value block 10 and 
to a multiplier 12. The absolute value signal from block 10 is forwarded to a 
look-up table LUTO representing a sampled version of polynomial To. The 
corresponding (generally complex) value from look-up table LUTO is for- 
warded to multiplier 12, where it multiplies the input signal sample x(n). 
Input signal x(n) is also forwarded to a delay block D, where it is delayed one 
or several sample periods for forming a delayed sample x(n-l). This delayed 
sample is processed in the same way as the non-delayed sample by an 
absolute value block 10, a multiplier 12 and a look-up table LUT1. However, 
look-up table LUT1 now represents a sampled version of polynomial Ti 
instead of To. As illustrated in Fig. 7, further delays and look-up tables may 
be included. Finally, the obtained products are added to each other in 
adders 14 to form the pre-distorted signal PD(n). Look-up tables used in 
accordance with the present invention make computation in real time much 
more efficient than the polynomial computation for each sample of the input 
signal used in [1J. The look-up tables may be updated (by using the training 
method described below) to keep track of slow changes in the characteristics 
of the power amplifier. 
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Fig. 8 is a block diagram of an exemplary embodiment of a base station 
including a power amplifier provided with a pre-distorter in accordance with 
the present invention. In Fig. 8 elements that are not necessary for under- 
standing the invention have been omitted. The baseband complex signal x(n) is 
forwarded to a pre-distorter 30 in accordance with the present invention. The 
pre-distorted signal is up-converted to intermediate frequency (IF) in a digital 
up-converter 32 and converted into an analog signal in a D/A converter 34, 
which in turn is up-converted to radio frequency (RF) by an analog up- 
converter 36. The RF signal is forwarded to a power amplifier 38, and the 
amplified signal is forwarded to an antenna. The amplified RF signal is also 
forwarded to a feedback down-conversion chain including an analog down- 
converter 40, an A/D converter 42 and a digital down-converter 44. The down- 
converted feedback signal y(n) is forwarded to a trainer 46, which also receives 
the input signal x(n) for determining the look-up tables in pre-distorter 30 in 
accordance with the mathematical principles described below. 

The training method performed by a processing unit 48 in trainer 46 in Fig. 
8 will now be described in more detail. In order to illustrate the training 
process, a pre-distorter 30 including two look-up tables To and Ti (corre- 
sponding to LUTO and LUT1 in fig. 7, with a one sample delay as an exam- 
ple) will be assumed. 

The training procedure is based on measuring a large batch of pre-distorter 
input and feedback signal pairs 

(x n9 y n ), n = l 9 ...,N 

where N is the number of signal pairs in the batch (depending on the 
required accuracy, the number of table entries etc, N is typically between 
2000 and 50 000). The measured sample pairs are stored in a memory 50. 
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The present invention suggests a simple method for determining the lookup 
tables. This method is based on determining the filter coefficients of each 
look-up table individually by a corresponding iterative LMS (Least Mean 
Square) based procedure. An example of such a method is described by the 
5 iterative least mean square based procedures for two tables: 

N b ]Xi]eMb \x k \ 

(3) 

where 

10 i is an iteration index, 

b identifies the table bin, 

pi and ju2 are loop constants associated with each filter tap, 
Nt is the number of stored input signal samples in the batch that have 
an amplitude that falls within a window Mb around the center amplitude of bin 
15 b, 

* denotes complex conjugation. 

From equation (3) it is noted that both lookup tables are iteratively solved by 
equations having the same mathematical form, but by using individual loop 
20 constants and different input samples (xk and Xfc-j, respectively). The real 

positive loop constants jui and fj2 may be equal, but are typically different (a larger 
loop constant for To may increase convergence speed). However, the sum of the loop 
constants should be less than 1 to guarantee stability of the procedure. 

25 The procedure may be repeated until convergence is reached (i.e. until there 

are only insignificant changes to the approximations from one iteration to 
the next). Depending on he required accuracy, convergence is usually 
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)btained after 10-20 iterations. However, if the accuracy requirements are 
elaxed, acceptable tables may be obtained after fewer iterations. 

Thus, by solving equation (3) iteratively as described above, the training 
problem has been reduced to repeatedly solving two equations having the 
form (3), which are only single table problems of lower complexity. 



Equation (3) is valid both for a first time training and for updating the look- 
up tables. If it is assumed that the tables have not been determined yet, To 
10 may, for example, initially be set to 1 and Ti may initially be set to 0. 

However, if the tables are updated, the current tables To, Ti may be used as 
an initial guess instead, since they are probably closer to the optimal tables. 

Fig. 9 is a flow chart illustrating an exemplary embodiment of the training 
15 method in accordance with the present invention for a two-table pre-distorter. 

Step SI sets the elements of tables To and Ti to predetermined values. Step S2 
measures and stores a batch of x and y values. Step S3 determines new 
estimates of To and Ti using equation (3). Step S4 tests whether the tables 
have converged. This can be done, for example, by slimming the absolute 
20 values (or squares) of the differences between corresponding entries of the 

current estimates and the previous estimates and testing whether the ob- 
tained sum is smaller than a predetermined threshold. If the tables have 
converged, the current estimates are provided as the final tables in step S5. If 
the tables have not converged, steps S2-S4 are repeated. 

25 

The outlined iterative method of successively improving table accuracy may 
be extended to include more tables. Further tables, which take into account 
memory effects of other time delays, may be included in the same manner as 
in the two-table solution. In this case the procedures may be written as: 
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(4) 

Thus, for a general delay q one has: 

Ub)-T„ib) + H,.-±r-- £ f^f-^ (5) 

As noted above, the delays q associated with the tables need not be consecu- 
tive integers, but may differ by more than 1 sample period. 

Equation (5) may be simplified by replacing the values \xk^\ 2 in the elements 
of the sum with the average power of the samples in bin b, thereby avoiding 
the divisions. This gives the equation: 



where 
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Here \x b \ denotes the center amplitude of bin b. 

A variation of the above described method is obtained by inserting a post- 
5 distorter, which has the same table structure as pre-distorter 30, in the 

feedback loop. Such an embodiment is illustrated in Fig. 10 where a post- 
distorter 52 has been inserted between elements 44 and 50. During training 
pre-distorter 30 is left unchanged and post-distorter 52 is determined instead. 
When it has been determined the tables in post-distorter 50 have converged, 

10 the tables are copied over to pre-distorter 30. The post-distorter will in this 

embodiment undo the distortion of the amplifier during training, since the 
pre-distorted samples are now amplified (and distorted) and then post- 
distorted back to the pre-distorted samples, which undoes the distortion of the 
amplifier. The same equations may still be used if x denotes the unamplified 

15 but pre-distorted samples and y denotes the post-distorted feedback samples. 

Fig. 1 1 is a flow chart illustrating an exemplary embodiment of the training 
method in accordance with the just described variation of present invention 
for two tables. Step S10 applies the post-distorter model. Step Sll sets the 

20 elements of tables To and Ti to predetermined values. Step S12 measures and 

stores a batch of x and y values. Step S13 determines new estimates of To and 
Ti using equation (3). Step S14 tests whether the post-distorter model has 
converged. This can be done, for example, by summing the absolute values (or 
squares) of the differences between corresponding entries of the current table 

2 5 estimates and the previous estimates and testing whether the obtained sum is 

smaller than a predetermined threshold. If the model has converged, the 
current table estimates are copied over to the pre-distorter in step SI 5. If not, 
steps S 1 3-S 14 are repeated. 

30 In the embodiments described so far an average correction based on a lot of 

samples was used to form the iterative equations. This requires storage of a 
large number of samples and updates all table entries. An alternative method 
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is to update after every sample. In this case only the table entries that corre- 
spond to the bin b into which the current input sample happens to fall is 
updated. For this embodiment equation (5) reduces to: 

\ x «-*\ 

while equation (6) reduces to: 

T vt (b) = T gi . l (b) + M q -<by(x k -y k yxl q : \x k _ q \eM b (9) 

An advantage of this embodiment is that only the latest Q (the maximum 
delay) sample pairs have to be stored in the batch, which requires less 
memory. This approach typically requires only one iteration for every new 
sample, since the current table entries in a stationary state should be very 
close to the optimal values. 

In the above description the training procedure has been expressed in Carte- 
sian form using a complex number representation. An alternative is to perform 
a transformation to polar form and express the procedure in two-component 
form, with one procedure representing the iterations in the radial component 
and one procedure representing the iterations in the phase component. 
Furthermore, in the given examples the delay q has been assumed to be 
positive or zero. However, negative values for q are also possible. For example, 
the zero delay table may be the middle table in the filter structure. 

The described look-up table based pre-distorter may be implemented as an 
FPGA (Field Programmable Gate Array) or an ASIC (Application Specific 
Integrated Circuit). Another possibility is to use a micro processor or a 
micro /signal processor combination and corresponding software. Combina- 
tions of these approaches are also possible. The actual computation of the 
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look-up table entries may be done either in real-time on a sample by sample 
basis or in an off-line manner at a slow update speed, as described above. 

An especially attractive feature of the present invention is that the complex- 
ity growth when tables aire added only is linear instead of quadratic. This 
reduced complexity may be used to increase the table update rate, thereby 
obtaining a more accurate pre-distorter. In fact, by using the procedure in 
accordance with equation (8) or (9) one entry of each table is updated every 
sample period. 

Although the present invention has been described with reference to individ- 
ual LMS based iterative procedures for each table, it is also feasible to use 
other iterative procedures, such as RLS (Recursive Least Square) or Kalman 
based procedures. Although such methods are inherently more complex, 
they have the advantage of a faster convergence rate. 

It will be understood by those skilled in the art that various modifications 
and changes may be made to the present invention without departure from 
the scope thereof, which is defined by the appended claims. 
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